71 research outputs found

    Mutation patterns of amino acid tandem repeats in the human proteome

    Get PDF
    BACKGROUND: Amino acid tandem repeats are found in nearly one-fifth of human proteins. Abnormal expansion of these regions is associated with several human disorders. To gain further insight into the mutational mechanisms that operate in this type of sequence, we have analyzed a large number of mutation variants derived from human expressed sequence tags (ESTs). RESULTS: We identified 137 polymorphic variants in 115 different amino acid tandem repeats. Of these, 77 contained amino acid substitutions and 60 contained gaps (expansions or contractions of the repeat unit). The analysis showed that at least about 21% of the repeats might be polymorphic in humans. We compared the mutations found in different types of amino acid repeats and in adjacent regions. Overall, repeats showed a five-fold increase in the number of gap mutations compared to adjacent regions, reflecting the action of slippage within the repetitive structures. Gap and substitution mutations were very differently distributed between different amino acid repeat types. Among repeats containing gap variants we identified several disease and candidate disease genes. CONCLUSION: This is the first report at a genome-wide scale of the types of mutations occurring in the amino acid repeat component of the human proteome. We show that the mutational dynamics of different amino acid repeat types are very diverse. We provide a list of loci with highly variable repeat structures, some of which may be potentially involved in disease

    The Pancreatic Islet Regulome Browser

    Get PDF
    The pancreatic islet is a highly specialized tissue embedded in the exocrine pancreas whose primary function is that of controlling glucose homeostasis. Thus, understanding the transcriptional control of islet-cell may help to puzzle out the pathogenesis of glucose metabolism disorders. Integrative computational analyses of transcriptomic and epigenomic data allows predicting genomic coordinates of putative regulatory elements across the genome and, decipher tissue-specific functions of the non-coding genome. We herein present the Islet Regulome Browser, a tool that allows fast access and exploration of pancreatic islet epigenomic and transcriptomic data produced by different labs worldwide. The Islet Regulome Browser is now accessible on the internet or may be installed locally. It allows uploading custom tracks as well as providing interactive access to a wealth of information including Genome-Wide Association Studies (GWAS) variants, different classes of regulatory elements, together with enhancer clusters, stretch-enhancers and transcription factor binding sites in pancreatic progenitors and adult human pancreatic islets. Integration and visualization of such data may allow a deeper understanding of the regulatory networks driving tissue-specific transcription and guide the identification of regulatory variants. We believe that such tool will facilitate the access to pancreatic islet public genomic datasets providing a major boost to functional genomics studies in glucose metabolism related traits including diabetes

    Methanol fixation is the method of choice for droplet-based single-cell transcriptomics of neural cells

    Full text link
    The main critical step in single-cell transcriptomics is sample preparation. Several methods have been developed to preserve cells after dissociation to uncouple sample handling from library preparation. Yet, the suitability of these methods depends on the cell types to be processed. In this project, we perform a systematic comparison of preservation methods for droplet-based single-cell RNA-seq on neural and glial cells derived from induced pluripotent stem cells. Our results show that while DMSO provides the highest cell quality in terms of RNA molecules and genes detected per cell, it strongly affects the cellular composition and induces the expression of stress and apoptosis genes. In contrast, methanol fixed samples display a cellular composition similar to fresh samples and provide a good cell quality and little expression biases. Taken together, our results show that methanol fixation is the method of choice for performing droplet-based single-cell transcriptomics experiments on neural cell populations. Methanol fixation appears to be the method of choice for droplet-based scRNA-seq on neural cell populations, based on a broad analysis of how various fixation or preservation methods impact the single-cell transcriptomes of hiPSC-derived neural cells

    Discovery of Cancer Driver Long Noncoding RNAs across 1112 Tumour Genomes: New Candidates and Distinguishing Features

    Get PDF
    Long noncoding RNAs (lncRNAs) represent a vast unexplored genetic space that may hold missing drivers of tumourigenesis, but few such "driver lncRNAs" are known. Until now, they have been discovered through changes in expression, leading to problems in distinguishing between causative roles and passenger effects. We here present a different approach for driver lncRNA discovery using mutational patterns in tumour DNA. Our pipeline, ExInAtor, identifies genes with excess load of somatic single nucleotide variants (SNVs) across panels of tumour genomes. Heterogeneity in mutational signatures between cancer types and individuals is accounted for using a simple local trinucleotide background model, which yields high precision and low computational demands. We use ExInAtor to predict drivers from the GENCODE annotation across 1112 entire genomes from 23 cancer types. Using a stratified approach, we identify 15 high-confidence candidates: 9 novel and 6 known cancer-related genes, including MALAT1, NEAT1 and SAMMSON. Both known and novel driver lncRNAs are distinguished by elevated gene length, evolutionary conservation and expression. We have presented a first catalogue of mutated lncRNA genes driving cancer, which will grow and improve with the application of ExInAtor to future tumour genome projects

    研究課題

    Get PDF
    An Excel file containing information on lncRNAs probed for FM bias. The first sheet contains a README file with details on the contents of the other two sheets. The second sheet contains the list of all lncRNAs probed, a summary of their biological function, and the original source from which we extracted them. The third sheet contains the FM bias p value computed for each significantly FM biased lncRNA in each cohort. (ODS 13 kb

    A compendium of mutational cancer driver genes

    Full text link
    A fundamental goal in cancer research is to understand the mechanisms of cell transformation. This is key to developing more efficient cancer detection methods and therapeutic approaches. One milestone towards this objective is the identification of all the genes with mutations capable of driving tumours. Since the 1970s, the list of cancer genes has been growing steadily. Because cancer driver genes are under positive selection in tumorigenesis, their observed patterns of somatic mutations across tumours in a cohort deviate from those expected from neutral mutagenesis. These deviations, which constitute signals of positive selection, may be detected by carefully designed bioinformatics methods, which have become the state of the art in the identification of driver genes. A systematic approach combining several of these signals could lead to a compendium of mutational cancer genes. In this Review, we present the Integrative OncoGenomics (IntOGen) pipeline, an implementation of such an approach to obtain the compendium of mutational cancer drivers. Its application to somatic mutations of more than 28,000 tumours of 66 cancer types reveals 568 cancer genes and points towards their mechanisms of tumorigenesis. The application of this approach to the ever-growing datasets of somatic tumour mutations will support the continuous refinement of our knowledge of the genetic basis of cancer

    Transplanting rejuvenated blood stem cells extends lifespan of aged immunocompromised mice

    Full text link
    One goal of regenerative medicine is to rejuvenate tissues and extend lifespan by restoring the function of endogenous aged stem cells. However, evidence that somatic stem cells can be targeted in vivo to extend lifespan is still lacking. Here, we demonstrate that after a short systemic treatment with a specific inhibitor of the small RhoGTPase Cdc42 (CASIN), transplanting aged hematopoietic stem cells (HSCs) from treated mice is sufficient to extend the healthspan and lifespan of aged immunocompromised mice without additional treatment. In detail, we show that systemic CASIN treatment improves strength and endurance of aged mice by increasing the myogenic regenerative potential of aged skeletal muscle stem cells. Further, we show that CASIN modifies niche localization and H4K16ac polarity of HSCs in vivo. Single-cell profiling reveals changes in HSC transcriptome, which underlie enhanced lymphoid and regenerative capacity in serial transplantation assays. Overall, we provide proof-of-concept evidence that a short systemic treatment to decrease Cdc42 activity improves the regenerative capacity of different endogenous aged stem cells in vivo, and that rejuvenated HSCs exert a broad systemic effect sufficient to extend murine health- and lifespan

    Divergent mutational processes distinguish hypoxic and normoxic tumours

    Full text link
    Many primary tumours have low levels of molecular oxygen (hypoxia), and hypoxic tumours respond poorly to therapy. Pan-cancer molecular hallmarks of tumour hypoxia remain poorly understood, with limited comprehension of its associations with specific mutational processes, non-coding driver genes and evolutionary features. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumour types, we quantify hypoxia in 1188 tumours spanning 27 cancer types. Elevated hypoxia associates with increased mutational load across cancer types, irrespective of underlying mutational class. The proportion of mutations attributed to several mutational signatures of unknown aetiology directly associates with the level of hypoxia, suggesting underlying mutational processes for these signatures. At the gene level, driver mutations in TP53, MYC and PTEN are enriched in hypoxic tumours, and mutations in PTEN interact with hypoxia to direct tumour evolutionary trajectories. Overall, hypoxia plays a critical role in shaping the genomic and evolutionary landscapes of cancer

    Integrative pathway enrichment analysis of multivariate omics data

    Full text link
    Multi-omics datasets represent distinct aspects of the central dogma of molecular biology. Such high-dimensional molecular profiles pose challenges to data interpretation and hypothesis generation. ActivePathways is an integrative method that discovers significantly enriched pathways across multiple datasets using statistical data fusion, rationalizes contributing evidence and highlights associated genes. As part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2658 cancers across 38 tumor types, we integrated genes with coding and non-coding mutations and revealed frequently mutated pathways and additional cancer genes with infrequent mutations. We also analyzed prognostic molecular pathways by integrating genomic and transcriptomic features of 1780 breast cancers and highlighted associations with immune response and anti-apoptotic signaling. Integration of ChIP-seq and RNA-seq data for master regulators of the Hippo pathway across normal human tissues identified processes of tissue regeneration and stem cell regulation. ActivePathways is a versatile method that improves systems-level understanding of cellular organization in health and disease through integration of multiple molecular datasets and pathway annotations
    corecore